Combining Appearance and Structure from Motion Features for Road Scene Understanding
نویسندگان
چکیده
In this paper we present a framework for pixel-wise object segmentation of road scenes that combines motion and appearance features. It is designed to handle street-level imagery such as that on Google Street View and Microsoft Bing Maps. We formulate the problem in a CRF framework in order to probabilistically model the label likelihoods and the a priori knowledge. An extended set of appearance-based features is used, which consists of textons, colour, location and HOG descriptors. A novel boosting approach is then applied to combine the motion and appearance-based features. We also incorporate higher order potentials in our CRF model, which produce segmentations with precise object boundaries. We evaluate our method both quantitatively and qualitatively on the challenging Cambridge-driving Labeled Video dataset. Our approach shows an overall recognition accuracy of 84% compared to the state-of-the-art accuracy of 69%.
منابع مشابه
Traffic Scene Analysis using Hierarchical Sparse Topical Coding
Analyzing motion patterns in traffic videos can be exploited directly to generate high-level descriptions of the video contents. Such descriptions may further be employed in different traffic applications such as traffic phase detection and abnormal event detection. One of the most recent and successful unsupervised methods for complex traffic scene analysis is based on topic models. In this pa...
متن کاملText extraction from scene images by character appearance and structure modeling
In this paper, we propose a novel algorithm to detect text information from natural scene images. Scene text classification and detection are still open research topics. Our proposed algorithm is able to model both character appearance and structure to generate representative and discriminative text descriptors. The contributions of this paper include three aspects: 1) a new character appearanc...
متن کاملThree-dimensional quantitative structure activity relationship approach series of 3-Bromo-4-(1-H-3-Indolyl)-2, 5-Dihydro-1H-2, 5- Pyrroledione as antibacterial agents
The use of quantitative structure–activity relationships, since its advent, has becomeincreasingly helpful in understanding many aspects of biochemical interactions in drug research.This approach was utilized to explain the relationship of structure with biological activity ofantibacterial. For the development of new fungicides against, the quantitative structural–activityrelationship (QSAR) an...
متن کاملاستفاده از برآورد حالتهای پویای دست مبتنی بر مدل، برای تقلید عملکرد بازوی انسان توسط ربات با دادههای کینکت
Pose estimation is a process to identify how a human body and/or individual limbs are configured in a given scene. Hand pose estimation is an important research topic which has a variety of applications in human-computer interaction (HCI) scenarios, such as gesture recognition, animation synthesis and robot control. However, capturing the hand motion is quite a challenging task due to its high ...
متن کاملVideoPlus: A Method for Capturing the Structure and Appearance of Immersive Environments
This paper presents a simple approach to capturing the appearance and structure of immersive scenes based on the imagery acquired with an omnidirectional video camera. The scheme proceeds by combining techniques from structure-from-motion with ideas from image-based rendering. An interactive photogrammetric modeling scheme is used to recover the locations of a set of salient features in the sce...
متن کامل